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I, Michael Jerome Bienkowski, Pb.D., hereby declare as follows: 


I. introduction 

1. I am a co-inventor of Asp2 subject matter claimed in various patent 
applic=^ans filed by Pharmacia & Upjohn. I make this declaration to provide information to 
fcc Patent OfEce that may be relevant to patent issues relating to enzymatically active, 
'tomeinbrane-deleted'' forms (ATM) of the Asp2 protein and polynucleotides which 
cKode such protein. When I refer to "I" or "we" in this declaration, I mean me and/or my co- 
svaoton and/or people working under our direction at Pharmacia & Upjohn. 

2. The term "Asp2" is the name that we gave to aspartyi protease 
polynucleotides and polypeptides that we isolated and described in the patent applications. 
At least two human and one murine form of Asp2 are taught in the patent applications. 
TL-ough experiments described in the patent applications we demonstrated that Asp2 exhibits 
prctaoiyric activity towards amyloid precr^or protein (APP) involved in processing APP into 
j^vicid be* (Ap), a peptide implicated in Alzheimer's Disease pathology. 

H. Ciouing of Asp2 and Identifying the Asp2 transmembrane domain 

3. My co-inventors and I performed and/or directed experiments which 
resulted in the identification and cloning of human Asp2 cDNAs. Our earliest experiments 
d-d not immediately yield full-length Asp2 cDNAs. We first obtained and sequenced two 
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-arc;*] clones denoted as clone 4386993 (hereinafter '438) and clone 2696295 (hereinafter 
'265). As explained in cur patent applications, Cloi.c '438 contains additional codons 
sequence at its 5' end relative to clone '269, but Clone '269 contains 25 additional codons (75 
basepairs) as an internal iiisertion relative to Clone '438. (These 25 codons represent the 
c:5erence between the leng and short forms of full length human Asp2 in Figures 2 and 3 of 
the patent applications.) 

4. After we sequenced the '43 S and l 269 clones we aligned the sequences 
with sequences of other aspartyl proteases as part of our analysis of them. From these 
alignments and other analysis we deduced that these sequences were incomplete cDNA 
sequences that were truncated at the 5' end (the amino-terminus of the encoded polypeptide). 
Coizputer-aided analysis of the predicted amino acid sequences indicated that the predicted 
amino acid sequence encoded by both '438 and '269 contained the DTG/DSG sequences 
indicative of the aspartyl protease active site, and were complete to the carboxyl-tenninus of 
the encoded polypeptide. 

5. By analyzing the partial Asp2 sequence from the '438 and '269 clones 
described in paragraph 3, we deduced that Asp2 contained a transmembrane domain. Our 
U.£. Provisional Application No. 60/101,594, filed September 24, 1998, describes the 
analysis as follows: 

Routine computer-aided analysis of the predicted amino 
acid sequence of Hu-Asp2a and Hu-Asp2(b) for secondary 
structure motifs resulted in detection of a predicted 
transmembrane domain in each polypeptide, which corresponds 
to Hu-Asp2(a) amino acid residues 367-392 of SEQ ID NO: 4, 
and of the sequence given in Figure 2, and to Hu-Asp 2(b) 
amino acid residues 392-417 of SEQ ID NO: 6, and of the 
sequence given in Figure 3. 

(See U.S. Provisional Application No.60/101,594 at p. 20.) 
As I explain in greater detail below, the stated location of the transmembrane sequences 
(367-392) and (392-417), through an inadvertent error, do not correspond to the 
transmembrane regions of the full length human As?2(a) and Asp2(b) proteins shown in the 
Figures, and standing alone, these numbers would not serve as a basis for identifying the 
transmembrane region of the human Asp2 sequences. However, our routine compuier-aided 


in ft*. w=il us to identify the Asp2 r^smembrane region, and a molecular 


^^of ^C^ility who read the zpplication and (through the guidance of the 

performed his/her own routine computer-sided analysis would have identified cc 
corset location of the transmembrane region in our As?2 sequences. 

6. Through our continued research we ultimately cloned additional 5' 
( ,--o terminal) cDNA sequence for the two human Asp2 enzyme isofonns. As reported in 
extent applications, the longer fulllength human Asp2 cDNA has 501 codons.' (Figure3 
of 4 patent applications.) As correctly reported in our 1999 patent applicatums, the 
trcKsofeau domain of this Asp2 clone spans approximately residues 455 to 477 of the 

f ill langa Asp2 sequence. 

7. Looking baclt, 1 believe that fee inadvertent error in the 60/155,493 
application occurred because our research team had performed some of the routine computcr- 
ied analysis onapanial Asp2 sequence from the '438 clone, and reported the data fronts 
^ysis for the full length Asp2 done in the patent application. The analysis of the partial 
sequence from the '438 clone indicated that the transmembrane domain corresponded 
approximately to residues 367-392 of the partial sequence. (See Exhibit A hereto, which is a 
computer-assisted analysis of Asp2 (clone '438) sequence for possible transmembrane 
domains, performed prior to September 24. 1998, which indicates a likely TM region at about 
367-392 ofthe sequenced analyzed.) I believe that the numbers from mis analysis of the '438 
pzrtld sequence were reported in the 60/1 55,493 application for the full length short form 
(Figure 2) of human Asp2. 2 Since the patent application reported the full length Asp2 
screes, the numbers that were generated using the '438 done partial sequence should 
have been adjusted upward for the prat application to account for the extra codons at the 
banning of the full length clone that were missing from the '438 clone partial sequence 


An Asp2 splice variant described in ourpatent application has 476 codons by 
v'-i. of the internal deletion of 25 codons described above m paragraph 3 (Figure * of the 
appl caS As reported correctly our 1999 patent ^applications the transmembrane 
dSln of this sequence corresponds approbate'* to residues 430^52. 

An upward adjustment of these numbers (by 25 codons) was used for the Ion- 
form of Asp2 (Figure 3). 


a^Livzed. But, "trough inadvertent error when preparing the patent application, this 
adjustment was r.ot made. 

8. la my opinion, this error would have been apparent to an average 
scions: in the field who evaluated the application, as would the proper correction of the 
error, in particular, it is commonly understood by molecular biologists that a transmembrane 
coi^in is characterized by a stretch of about 20-25 mostly hydrophobic amino acids. When a 
biologist read the application's teaching that Asp2 had a transraembrane domain near the 
c^ocxy-teiminus and then examined the sequence to look for that transmembrane domain, it 
would have been readily apparent that the transmembrane domain was at about residues 455- 
477 (of Figure 3), and not residues 392-417. 

IH. Invention-related activity for As?2 ATM polynucleotides and polypeptides. 

9. The attorneys for Pharmacia £ Upjohn have asked me to authenticate 
ano discuss certain documents relating to our Asp2 invention. 

1 0. Exhibit B hereto comprises excerpts from our U.S. Provisional 
Application No. 60/101,594. These excerpts establish that, on or before our filing date of 
Seoteaber 24. 1998, we had possession of two human Asp2 cDNA and deduced Asp2 amino 
aoid sequences (Figures 2 and 3) and determined various Asp2 structural features, including 
the presence of a transmembrane domain. It shows £at we contemplated vectors and host 
co:l3 for recombinant production of Asp2 polypeptides and enzymatically active polypeptide 
fegaents (see, e.g., pp. 4, 5, and 9), and that we contemplated Asp2 antibodies (see. e.g.. pp. 
4, 12.) H shows that we contemplated expression cf Asp2 in a variety of expression systems, 
athsdasgprokaryoas such as E. coli (pp. 9 and 10), yeasts such as S. cerevisiae (pp. 9, 1 1), 

iigher euksryotes such as insect cell systems and mammalian systems, including COS 
cells, CHO cells, and human cells (see, e.g., pp. 9, 1 i-12). 

i 1 . Exhibit C hereto is a copy of & page from a Pharmacia & Upjohn 
inxroffice memo from prior to our September 24, 1998, filing date, containing a report on the 
Human Asp2 project. Among other things, this except shows that, prior to September 24, 


4 


1SW we had engineered die Asp2 open reading to (ORF) from Ac '438 and '269 clones 
to remove the trartfmembrane domains, and that we bad inserted these ATM constructs into 
~ 5. co// expression vector pQE30. 

12. Exhibit D hereto are copies cf pages from a Pharmacia & Upjohn 
Oratory notebook. These pages establish that, prior to December 31, 1998, we had made a 
husa Asp 2 ATM construct confining the DNA sequence coding for human As ? 2 amino 
Ld, M54 (long foi= shown in Figure 3 of patent applications) in a baculovirus expression 
vccxrpVL 1293 (hu Asp 2 ATM pVL 1393) for expression in SF9 insect cells. This 
cessruct was sent for sequencing and the sequence was confirmed. Exhibit E hereto are 
cc;.ics of pages from a Pharmacia & Upjciin laboratory notebook. These pages establish that 
air December 31, 1998, we had made similar contacts with 6-hisudine tags to facilitate 
previa purification. 

13. Exhibit F hereto are copies of pages from Pharmacia & Upjohn 
laboratory notebooks which show that, prior to March 26, 1999, we had expressed human 
Ai?2 ATM pro-vein (without A secretase enzyme activity) in £ Coli to make antibodies for use 
is Siting of recombinant expression of human Asp2 ATM in other cell types. 

14. Exhibit G hereto are copies of pages from a Pharmacia & Upjohn 
Oratory notebook which show that, prior to March 26, 1999, we had made, isolated, and 
sc^ed-up preparations of viral plaques for production of a human Asp2 ATM construct in 
S?9 insect cells. 

15. Exhibit K hereto contains copies of pages from a Pharmacia & Upjohn 
laboratory notebook which show that, after December 3 1 , 1998, and prior to June 15, 1999, 
aid prior to September 23, 1999, the scale-up results from SF9 were analyzed. Exhibit 1 are 
cc ? isi of a Pharmacia ar.d Upjohn laboratory notebook showing a gel depicting the results of 
such analysis. A clean band of human Asp2 ATM expressed protein was identified by 
Wes-am bio; as shown in the notebook. This band is believed to contain active human ATM 
Aip2 1-454 protein expressed in the SF9 system. 
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1 6. Exhibit J are copies of pages from a Phannacia & Upjohn laboratory 
notebook showing that, after March 26, 1999, but prior to September 23, 1999, we excised 
the Asp2 ATM coding segment from the pVL 1393 vector described above, inserted i: 
into PIZ vector, and expressed this Asp2 ATM constat in Higi Five CeUs. We tested this 
recombinant human Asp2 ATM protein and showed that it retained human Asp2 enzymatic 
activity. This work is also generally described in the patent applications that we filed on 
Sc>:^ber 23, 1999, including PCT/US99/20S81, U.S. Provisional Application No. 
60/155,493, and U.S. Application Serial No. 09/404,133. 

1 7. As shown in part by the representative documents referred to in the 
preceding paragraphs, during the period prior to September 24, 1998, until September 23, 
1999, we were engaged in substantially continuous activity to make enzymatically active 
buiian Asp2 protein lacking a transmembrane domain, using materials and methods that we 
had contemplated in our September 24, 1998, patent application and/or had produced by that 
Somber 24, 1998 filing date. 

IV. Certification 

18. I hereby declare that all statements made herein of my own knowledge 
are tn.e and that all statements made on information and belief are believed to be true; and 

f urier that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1 001 of Title 
1 S of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 


Date: November 30, 20C1 

Michael 


;hael Jerome EicLkowski, Ph.D. 
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Figure 3 Alignment of Prosite Aspartyl protease consensus sequence with active site motifs 
in Hu_Asp-2 

[LIVMFGAC] - [LIVMTADN] - [LIVFSA] -D- [ST] -G- [STAV] - [STAPDENQ] -X- 
[LIVMFSTNC] -X- [LIVMFGTA] 

N-Terminal motif: ILVDTGSSNFAV 

[LIVMFGAC] - [LIVMTADN] - [LIVFSA] -D- [ST] -G- [STAV] - [STAPDENQ] -X- 
[LIVMFSTNC] -X- [LIVMFGTA] 

C-Terminal motif: S I VDSGTTNLRL 


Figure 4 
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v a- on, the nucleic acid molecules comprise a polynucleotide 
U a p.fetxed a*—* *» ^ ^ JMM0 of 

^ a nocfcoU* «*~» of SEQ ID NO:3. «o*» Hu- 

^ and ««-. «*" ^ comprising , pCynncfcoUd, wh*n 

- — provide, an - ^ ^ 

Hu-Asp-2(b), o, taj-. ^ ^ ^ homobgy , 0 wMe 

international apphcauon WO 

™ "ITIT^ «* — ^ been indeed, a* 

recomWna* nKthods of «m . » J 

--rrr rr-rr si — - r 

SEQ ID NO* SEQ ID NO:4. » SEQ * >NO. ^ J ^ - 

polyclonal and monoclonal, that tnnd specially any 

H^i^-^^^rrT---- of an a g e« 
The invention also provides a method tor 

. r ft fH„ Asd-1 Hu-Asp2(a), and Hu-Asp2(b). 
modulates the activity of any of Hu-Asp-i,nu v 

X" — CSEQ » an, p^ - 

- - C"™^ <SEQ ID - ^ - 

' Am0M The predicted transmembrane domain of 

acid sequence (SEQ ID NO:6) of human Asp2(b). The prea 

after addition of the di-Lys motif using '•patch" PCR. 
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^ present .ovenu - sen ^ ^ proK _ such u 

pepsin and «- possess a two do _ ^ ^ DTQ 

m me proenzyme (pro*™. W-*-^ rf ^ ^ ^ m0|if 

^r cleavage of the N-temunal prodon^n Th for novcl 

^party. proteases. ^7™^ L^hetooshonasequencemspanthe 

method ismscandatahasesofhypotnetiea,., 
active site moufs. Thepreferr , computer melh od to 

ambled prouin coding ^ ^ abases. The method was used 

„ identify seven candidate asp^ P ^ ^ ^ ^ ^ 

^vespUcevananuo^^. ^ ^ ^ ^ ^ 

to another embod-ment, d. > _ P o{ 

ha. aspanyl protease 1 (Hu-Asp H ,-Asp2(b). As used herein, all 

-—-rl"t— .I U ,. bo* Hu-Asp2Ca> and H, 
Kta l^"Z ^re^ -st abundant* inpancreasandpro^ Uss.es. wn.Hu- 

«. ^..hereof which eihibil aspartyl protease activity. 
^ weD as fragments thereof wtucn Hu-Asn2(a) and Hu-Asp2(b) share 

Themedicted amino acid sequences of Hu-Aspl, Hn Aspzw f 

, nreviously identified mammalian aspartyl proves such as 

» ^ ,to ^ 0 ^ B P :rpL D .caU«psmE.and^P.B. S «,W/. 
pepsinogen A. pepsinogen B. cafcep . m chanctcrized by the 

Oh. 1A inn*. 52:(Suppl. 210 5-22 (1992)). ^ 2 

presence of a duplicated DTG/DSG sequence moot The nu Asp 
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• Tr.„tlv different from a naiive Hu-Asp polypepiide in molecular 
^y* similar to or s.gmficantly different fr ^ systems 
weight and glycosylate pattern. Express.on of Hu-Asp in 

will provide non-glycosylated Hu-Asp. ^ ^ 

*~ ^ 0 7 cluomatograph,. Pbosphocen^ 

ethanol precipitation, anton or catton * ^ chromatography. 

hy(W a.Ue chromatography fer foa 

r *TT^ vjors comprises - polynucleotide 
^ preS eot — w ^^ nu— withsu chvec,o B .A ny o ft he 

polynucleotide molecules ol tne _ prop agation in a host Because 

s u. invention also proves Hu-Asp polyp ^^^ecxon 

include DNA encodmg any of 0>e Hu Asp porypep 

t0 suKat^pUo^ or transanal ^ 

„, apP ropnate sequences ^rJaiory sequence functionally relates to the PNA 

25 sequence. Qf ^fcoUde molecules 

* ~* ^ypepUdes. wul of «u«e depend upon 

0* bos, cel. tn winch the vector win ^ ^ of 

ta, which the Hu-Asp polypepude . to be which 
,0 Hu-Asp polypeptides include prolcaryotes, yeast, and higher eukaryotic * 

te discussed telow. hoa ^ ^ ^ be fusion 

^ Hu-Asp P0^ «0jee^ Sochrc ^ myb e included to 

proteins which include regions from heterologous v 
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^ €Jf .. secretion, improved stability, or facilitated purification of .he polypeptide. For 
example a sequence encoding an appropria-e signal peptide can be incorporated into 
expression vectors. A DNA sequence for a signal peptide (secretory leader) ma, be fused 
in frame ,o the Hu-Asp sequence so that Hu-Asp is uanslated as a fusion protein comprising 
to signal peptide. A signal peptide that is functional in the intended host ceH promotes 
extracellular secretion of the Hu-Asp polypeptide. Preferably, the signal sequence wul be 
ck aved from the Hu-Asp polypeptide upon secretion of Hu-Asp from the eel Non-limiting 
example, of signal sequences that can be used in practicing the invention include the yeast 
1-fac.or and the honeybee melatin leader in sB insect cells. 

to a preferred embodiment, the Hu-Asp polypeptide wul be a fusion protein winch 
i^des a heterologous region usee to facilitate purification of the polypeptide. Many of the 
available peptides used for such a function aBow selective binding of the fusion protein to a 
binding partner. For example, the Hu-Asp polypeptide may be modified to comprise » 
peptide to form a fusion protein which specifically binds to a binding partner, or peptide tag. 
Non-limiting exanp.es of such peptide tags include the 6-Hs «a g 7 ^oredoxin ug 
kemaglutihin tag. GST tag, and OmpA signa! sequence tag. As win be understood by one of 

^ u . Mrtn „ r wh ich recognizes and binds to the peptide may be any 
skill in the art, the binding partner wmcn recognn«> o« 

molecule or compound including metal ions (e. g .. metal affinity columns), antibodies, or 
fragments thereof, and any protein or peptide which binds the peptide, such as the FLAG 


20 tag 


Suhable host ceDs for expression of Hu-Asp polypeptides include prokaryo.es. yea*, 
and higher eukaryotic ceus. Suitable prokaryotic hosu .o be used for the expression of Hu- 
Asp include bacteria of the genera Escherichia, Bacillus, and Sal^lla. as well as 
members of the genera PseuJa^nas. Snepu>myccs, and Smphylococcus. For expression 
a in, e . £ ccU. a Hu-Asp polypeptide may include an N-terminal memionine residue to 
facilitate expression of the recombinant polypeptide in a prokaryotic hosu The N-fcrminal 
Me. may optionally then be cleaved from the expressed Hu-Asp polypeptide. 

Expression vectors for use in prokaryotic hosu generally comprise one or more 
pheuow* select marker genes. Such genes general* encode. e.g.. . pnMein that 
M confers antibiotic resisunee or Ota. supplies an auxotrophic requirement A wide variety of 
such vectors are readDy available from commercial sources. Examples include pSPORT 
vectors, pGEM vec.ors (Promega). pPROEX vectors (LTI. Bethesda. MD). Bluescrip, 
vectors (Stratagene). and pQE vectors (Qiagen). 
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Hu-Asp may also be expressed in yeast hosl cells from genera including 
SaccHaromyc*,, HcHia, and Klu^cs. Preferred yeas, hosts are S. c<~ and P. 
P as«,rU Yeas, vectors will often contain an origin of replication sequence from a 2T yeast 
Plasmid. an autonomously replicating sequence (ARS), a promoter region, sequences for 
po.yadeoyiation. sequences for transcription termination, and a selectable marker gene. 
Vectors replicable in both yeas, and £ cp/i (termed shu.de vectors) may also be used. In 
addition ,o me above-mentioned features of yeas, vectors, a shut* veaor wffl also inch.de 
recces for replication and selecuon in E. coU. Direct secretion of Hu-Asp polypeptides 
expressed in yeas, hosts may be accomplished by <he indusion of nuclide sequent 
encoding me yeas, I-fac.or leader sequence a, the 5' end of the Hu-Asp-encoding nucleotide 

^"""Lc, hos, cell cuhure systems may also be used for me expression of Hu-Asp 
polypeptide, In a preferred embodiment, the Hu-Asp polypeptides of me invention are 
expressed using a baculovinH expression system (*« Example 3). Further information 
regarding me use of baculovirus syaems for *e expression of heterologous proteins » 
insec, ceUs are reviewed by Luckow and Summer, Bio^ec^ology 6:47 (1988). 

in anomer preferred embodiment, 0* Hu-Asp polypeptide is expressed u. 
^aUan host cen, Non-limiting examples of soluble mammaUa. ceE Unes include Ae 
COS-7 Une of monkey kidney ce* (Ouzman * al.. CeU 23:.75 (.98,)) and Chinese 
„ ovary (CHO) ceU, Preferably, human embryonic kidney cel. Hue, 293 is used for 
expression of Hu-Asp proteins. 

The choice of a suirable expression vec,or for expression of the Hu-Asp P*P^ 
of me invention win of course depend upon *e specific mammalian hos, ceu to be used, and 
* wtthin me skffl of *e ordinary artisan. Examples of sui.able expression vecors inch.de 
, pcDNA3 (inviti-ogen) and pSVL (Pharmacia Bio«ch). A preferred vector for expression of 
Hu-Asp po.ypep.ides is pBK-CMV (S,ra,agene). Expression vectors for use in mamrnataan 
hoa ceus may inch.de transcriptional and translation*. con.ro. sequences derived from v*U 
genomes. Commonly used promoter sequences and enhancer sequences which may be used 
1 te present invention inch.de, bu, arc no, Med to. .hose derived from human 
0 cy,omega.ovirus (CMV). Adenovirus 2. Po.yoma viru, and Simian v»us 40 (SV40). 
Memods for me consuuetion of mammalian expression vectors are disdosed. for exampfe 
m okayam, and Berg (|ML CM BioL 5:280 (.983)); Cosma. « al WoL l^L 
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25:935 (1986)); Cosman « al. Mature 572:768 (1984)); EP-A-0367566; and WO 
91/18982. 

The polypeptides of the present invention may also be used to raise polyclonal and 
monoclonal antibodies, which are useful in diagnostic assays for detecting Hu-Asp 
polypeptide expression. Such antibodies may be prepared by conventional techniques. See, 
for example, Antibodies: A Laboratory Manual, Harlow and Land (eds.). Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1988); Monoclonal Antibodies. 
Hybridomas: A New Dimension in Biological Analyses, Kennet ex al (eds.), Plenum Press, 
New York (1980). 

The Hu-Asp nucleic acid molecules of the present invention are also valuable for 
chromosome identification, as they can hybridize with a specific location on a human 
chromosome. Hu-Aspl has been localized to chromosome 21, while Hu-Asp2 has been 
localized to chromosome 11. There is a current need for identifying particular sites on the 
chromosome, as few chromosome marking reagents based on actual sequence data (repeat 
polymorphisms) are presently available for marking chromosomal location. Once a sequence 
has been mapped to a precise chromosomal location, the physical position of the sequence 
on lh e chromosome can be correlated with genetic map data. The relationship between 
genes and diseases that" have been mapped to the same chromosomal region can then be 
identified through linkage analysis, wherein the coinheritance of physically adjacent genes is 
determined. Whether a gene appearing to be related to a particular disease is in fact the 
cause of the disease can then be determined by comparing the nucleic acid sequence between 

affected and unaffected individuals. 

In another embodiment, the invention relates to a method for the identification of an 
agent that increases the activity of a Hu-Asp polypeptide selected from the group consisting 
of Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b). the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
determined in the absence of said test agent; 

whereby a higher level of activity in the presence of said test agent than in the absence of 
said test agent indicates that said test agent has increased the activity of said Hu-Asp 
polypeptide. 


12 


FIGURE 2 


70 »0 


130 -GAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LPRt- i. L' EE PEEPGRR" " " 

VEMVDNLRG K q S» « « ^ 

VG SPPQTLN I L V 

310 ITTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
P H P 

370 GG AAGGG TGTG T ATG TGC C C T A CACC C AGGG CAAGTGGG AAGGGG AG 


GTOQQTJCTJXCCCj^ T 

Lcgggac^aac^^^ £ 


CTOOOClSSWr^^ 

L G T 49 S L 510 530 


T E SO K F- J 1 N' G S""N""W *~E^""G I V- ■ 

gaagJ^ctc^^ 


670 690 


ACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTA1TATGAGG 
TGSLWYTPI R R E W Y Y E V q 

730 .— 750 
3AGA' 

R V E I « « w « « « i0 - " - 830 


CG^AGATCAA^^ 

VEINGQDL | iq M 830 

AGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGIGTTTGAAGC^ 
SI VDSGTTN I. R L 890 


N L*R L P K K V g F o E A A 

CT^CATC^ 


SQS STGTVM O A V 1190 
rBOAlSScC^ 

PDRARKRIS F A V s» i25Q 
T R ,290 «10 

TGCGCCC^CTTCATGCTOCCACTCIGCCTCATGOTGTGTCAOTOQCGCTGCCTCCGCTOC 


6177 

C A ?„S M L P L C L « V C Q W K i4 C Q L R C 

CTGCGCC^GCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAGTGAGGAGCCCCA 
L R ? q H D D F A D^D I S L L 1^ 

raGCAGAAGATAGAGAircccc^^ 

ctctgccttcatggagaaggaaaaggctggcaaggtgggttccagggac 

GAA acaga1aWgaagaaagaag^ 
titaagtcg^gaaattctgctgcttgaaacttcagccctga^ 

TTAAAT^TCCAACCCAAAGTATrCTTCT^TTAGTTTCAGAAGTA^ 

ccxtoctoXcju^^ 

(^ctcta?aa\caagcctaacaito^aaagattcx:ctcttgaaaaa^ 


FIGURE 3 


VEMVDNLRG K S <» « 29Q 

VGSPPQ™ 350 
.^^CCCCCACCCC^C^^ACTACCAOJ^T^OCACA 

V G A A P H P F. I. H 9Q K 41 „ 


YRDLRKGVY V P * « 4?0 

t <~ T D L V S I r " _ 


CTC 


AAITESDKF PI » H0 

LoTCA^-C^A^WA^T^TCOAC^AACaAC 
EVl I.VRVE X * *• * 890 


LIIaaI^Laaa^atc^ 

V FEAAVKS I J » » a 1010 

^^^^ssacacca^^ 

GFWLGEQLV C W « 10?0 
r ^ aT A^A T AA^^TT«=AACCA^C^ TC ACC 

ATecTTCCGCAG«ATACCTGC<^CMT^ 

ILPQQ YLRP ,170 119° 
.ACAA^CATC^CAOTCA^CC^AC^A^A^^TCAT^ 

YKFAISQSS T »- * 1250 

1210 m^irrrw a ivfiftnc CCG AAAACGAATOGGCTTTGCTGTCAGCGCTTGC 
^^TACGTI^C^AICC^CCGAAAAC j q p ^ fi x c 

HVHDEFRTA A v 1J?0 

^C^TACAACATI^ 


E D C 0 » « ! P C * » ^ STL « j4 T o I * Y 

orel * 0 SKcc««««e^^ 

VM A A I GAL 7ft 1490 

c^^-cc^cccAC^ 

rc lrclrQQ » d u l550 

^^aLcccat^cacaacatKacattcccc^^ 

L^IJaacta^cacaca^^^ 
c cacccaIIaaa TC cc«^c™£acaa«^^ 


CTTCGTCACCTCAAATTTAAGTCGOGAAATTCTGCTGC^ * 1850 

1810 „™^J;??r)^GTAlTCTTCITrrcTTAGTTTCACAA 

ACCATICCTrTAAATTCTCCAACCCAAAGTAXiVi 

i"° .^^^TTACC^^TCCCT^TACCCTOOCA^AAO 


TOTCCACCATTCCTrrAAATTCTCCAACCCAAACTATTOTC ^ 
^C^ATCACACOCA^rTACC^C^TCCCT^TACCCT 

1990 » r^GCCTAACATTC3GTGC AAAGATTGCCTCTTC3A 


TGC TTTAG AGACAGGGACTGTATAAACAAGCCTAA 1 
ATOAAA AAAAAAAAAAAAAAAAAAAAAAAA 


INTEROFFICE MEMO 


Page 3 of 5 


coding sequence of Hu_Asp-l has been prepared and the predicted amino acid sequence, aligned with 
both the short and long forms of Hu_Asp-2, is attached. This splice variant of Hu_Asp-l encodes a 521 
amino acid polypeptide including a 27 residue signal peptide so the pro-form of the enzyme contains 76 
amino acid residues upstream of the first active site motif. This upstream sequence also contains a third 
DSG motif. Alignment of the sequence surrounding this upstream DSG with the ProSite motif for 
aspartyl proteases revealed a poor match while the other two DTG/DSG motifs showed a good match. 
Alignment, with Hu_Asp-2 sequences using the Clustal W algorithm highlights two major differences 
between Hu_Asp-l and Hu_Asp-2; the NH 2 terminal extension in Hu_Asp-l is much longer and that 

Hu_Asp-l appears to be more like the long form of Hu_Asp-2. The longest stretches of amino acid 
identity align with the two aspartyl protease active site motifs although other areas of conservation are 
also scored. . 

Finally, the Hu_Asp-l gene was localized to human Chromosome 21 by hybridization to a Southern blot 
containing a series of mouse/human or hamster/human somatic cell hybrids (attached). 

Hu_Asp-2, Maiy provided an inventory of the expression constructs for Hu_Asp-2 (attached). The 
entire ORF of both the short (438) and long forms (269) of Hu_Asp-2 have be engineered into the 
mammalian cell expression vector pBK-CMV. Also, both the short 

and long forms, with the COOH-terminal transmembrane domain deleted, have heenpreparedas NH 2 

terminal 6His-fusions.in the Er<oli expression~vector~pQE3.0. Finally, the entire ORF from the short 
form of Hu_Asp-2 has been cloned downstream of the ecdysone-inducible promoter in the vector pIND 
and in a polycistronic fusion with GFP (pIRESGFP) for mammalian cell expression studies. 


Hu_Asp-3 and Hu_Asp-4 — Queries of the LifeSeq Assembled database with the sequences of either 
Hu_Asp-l or Hu_Asp-2 identified (1) gene bins with exact matches to the query sequences, (2) gene 
bins matching the 5 known human aspartyl proteases [pepsinogen A, pepsinogen C, cathepsin D, 
cathepsin E and renin], and (3) three gene bins with significant homology [242842, 242824, 3951 1], in 
descending order of significance. Translation of the longest assembled templates contained within these 
gene bins revealed that they each encoded polypeptides containing the duplicated active site motif that is 
the hallmark of mammalian aspartyl proteases. Alignment of the predicted amino acid sequences for 
templates 451054.3 and 451034.4 showed that they were very similar with approximately 90% sequence 
identity at the amino acid level (attached). Template 126360 was most related to 451054.3 and 
451034.4, with approximately 70% shared identity. Consistent with the nomenclature initiated 
previously, the genes represented by Incyte templates 451054.3, 451034.4, and 126360 are referred to as 
Hu_Asp-3, Hu_Asp-4a and Hu_Asp-5, respectively. Template 451034.2 appeared to be a splice variant 
of 450134.4 with a 25 amino acid (75 bp) insertion near the C0 2 H-terminus (data not shown). The 

cDNAs that defined the 5'-most sequence of each of these templates were identified, obtained for 
sequence analysis and determination of the tissue distribution of expression of transcripts derived from 
these genes. TTie Hu_Asp-3 probe visualized a single 1 .6 kb transcript that showed a limited expression 
pattern that was expressed at the highest levels in lung, immunological tissues (spleen, thymus and 
PBLs), and kidney (attached). No expression of Hu_Asp-3 transcripts was detected in whole brain while 
a weak signal was observed in several brain regions including the medulla, spinal cord and putamen 
(attached). These results were consistent with the expression pattern determined by EST sequencing in 
LifeSeq Assembled (39 ESTs) which indicated highest expression in the hematopoietic/imnr* ma 
category (41%) and the nervous category being the second highest (16%). The Hu_Asp-4 p 
visualized a similar pattern of transcript size and abundance except that the signal was mos* 
in lung tissue. No transcripts were detected in either whole brain or selected brain regions ? 
conditions used in these experiments. A survey of expression using LifeSeq Assembled (1* 
indicated that 93% of the ESTs that comprise the Hu_Asp-4 template were derived from r 
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